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This paper is a continuation of our research work on state complexity of combined operations. Moti- 
vated by applications, we study the state complexities of two particular combined operations: catena- 
tion combined with star and catenation combined with reversal. We show that the state complexities 
of both of these combined operations are considerably less than the compositions of the state com- 
plexities of their individual participating operations. 

1 Introduction 

It is worth mentioning that in the past 15 years, a large number of papers have been published on state 
complexities of individual operations, for example, the state complexities of basic operations such as 
union, intersection, catenation, star, etc. (5l|7j|9l[T0l[T4l[T6l[T7l[T8l, and the state complexities of several 
other operations such as shuffle, orthogonal catenation, proportional removal, and cyclic shift (2l [3j ffl 
[TlTl . However, in practice, it is common that several operations, rather than only a single operation, are 
applied in a certain order on a number of finite automata. The state complexity of combined operations 
is certainly an important research direction in state complexity research. The state complexities of a 
number of combined operations have been studied in the past two years. It has been shown that the state 
complexity of a combination of several operations are usually not equal to the composition of the state 
complexities of individual participating operations ll6l [T2l[T3l[j31l . 

In this paper, we study the state complexities of catenation combined with star, i.e., L\L\, and rever- 
sal, i.e., LiLf , respectively, where L\ and L2 are regular languages. These two combined operations are 
useful in practice. For example, the regular expressions that match URLs can be summarized as L\L\. 
Also, the state complexity of L\h\ is equal to that of catenation combined with antimorphic involution 
{LiQ{Lq)) in biology. An involution function 6 is such that 6 2 equals the identity function. An antimor- 
phic involution is the natural formalization of the notion of Watson-Crick complementarity in biology. 
Moreover, the combination of catenation and antimorphic involution can naturally formalize a basic bi- 
ological operation, primer extension. Indeed, the process of creating the Watson-Crick complement of a 
DNA single strand w\M>2 uses the enzyme DNA polymerase to extend a known short primer p = 9(\V2) 
that is partially complementary to it, to obtain 6{w2)Q{w\) = 6{w\W2)- This can be viewed as the cate- 
nation between the primer p and 6(wi). The reader is referred to (TJ for more details about biological 
definitions and operations. 

It has been shown in lfl8l that (1) the state complexity of the catenation of an m-state DFA language 
(a language accepted by an m-state minimal complete DFA) and an n-state DFA language is m2 n — 2 n ~ x , 
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(2) the state complexity of the star of a &-state DFA language, where the DFA contains at least one final 
state that is not the initial state, is 2 k ~ l + 2 k ~ 2 , and (3) the state complexity of the reversal of an /-state 
DFA language is 2'. In this paper, we show that the state complexities of L\V 2 and L\h\ are considerably 
less than the compositions of their individual state complexities. Let L\ and L2 be two regular languages 
accepted by two complete DFAs of sizes p and q, respectively. We will show that, if the g-state DFA 
has only one final state which is also its initial state, the state complexity of L\L* 2 is p2 q — 2 q ~ l ; in the 
other cases, that is when the g-state DFA contains some final states that are not the initial state, the 
state complexity of L\L\ is (3p — l)2 q ~ 2 . This is in contrast to the composition of state complexities of 
catenation and star that equals (2p — 1)2 2<7 ' +2< ' 2_1 . We will also show that the state complexity of LiLf 
is p2 q — 2 q ~ l — p + 1 instead of p2 2 '' — 2 2q ~ l , the composition of state complexities of catenation and 
reversal. 

The paper is organized as follows. We introduce the basic notations and definitions used in this 
paper in the following section. Then, we study the state complexities of catenation combined with star 
and reversal in Sections [3j and 01 respectively. Due to page limitation, we omit the proofs of Lemma [Q 
Lemma [2l Lemma [3j Lemma [U Theorem [51 and Lemma [5j We also omit the proof of Theorem [2] for the 
case when m > 2 and n > 3. We conclude the paper in Section [5J 

2 Preliminaries 

An alphabet £ is a finite set of letters. A word w G £* is a sequence of letters in £, and the empty word, 
denoted by A, is the word of length. 

An involution 8 : £ — > £ is a function such that d 2 = I where / is the identity function and can be 
extended to an antimorphic involution if, for all u,v G £*, d(uv) = 6(v)6(u). For example, let £ = 
{a,b,c} and define 8 by 8(a) = b,8(b) = a, 8(c) = c, then 8(aabc) = cabb. Note that the well-known 
DNA Watson-Crick complementarity is a particular antimorphic involution defined over the four-letter 
DNA alphabet, A = {A,C,G,T}. 

A non-deterministic finite automaton (NFA) is a quintuple A = (<2,£, 8,s,F), where Q is a finite set 
of states, s € Q is the start state, and F C Q is the set of final states, 8 : Q x £ — > 2@ is the transition 
function. If 1 8 (q, a) \ < 1 for any q € Q and a € £, then this automaton is called a deterministic finite 
automaton (DFA). A DFA is said to be complete if 8(q,a) is defined for all q € Q and a G £. All the 
DFAs we mention in this paper are assumed to be complete. We extend 8 to Q x £* — y Q in the usual way. 
Then the automaton accepts a word w € £* if 8(s, w) n F ^ 0. Two states p,q G Q are equivalent if the 
following condition holds: 8(p,w) G F if and only if 8(q,w) G F for all words w G £*. It is well-known 
that a language which is accepted by an NFA can be accepted by a DFA, and such a language is said to 
be regular. The language accepted by a finite automaton A is denoted by L(A). The reader is referred 
to (8j [191 f° r more details about regular languages and finite automata. 

The state complexity of a regular language L, denoted by sc(L), is the number of states of the minimal 
complete DFA that accepts L. The state complexity of a class S of regular languages, denoted by sc(S), 
is the supremum among all sc(L), L G S. The state complexity of an operation on regular languages is 
the state complexity of the resulting language from the operation as a function of the state complexities 
of the operand languages. For example, we say that the state complexity of the intersection of an m-state 
DFA language and an rc-state DFA language is exactly mn. This implies that the largest number of states 
of all the minimal complete DFAs that accept the intersection of two languages accepted by two DFAs of 
sizes m and n, respectively, is mn, and such languages exist. Thus, in a certain sense, the state complexity 
of an operation is a worst-case complexity. 
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3 Catenation combined with star 

In this section, we consider the state complexity of catenation combined with star. Let L\ and L 2 be two 
languages accepted by two DFAs of sizes m and n, respectively. We notice that, if the rc-state DFA has 
only one final state which is also its initial state, this DFA also accepts L 2 . Thus, in such a case, an upper 
bound for the number of states of any DFA that accepts L\L* 2 = L\L 2 is given by the state complexity of 
catenation as m2 n — 2 n ~ x . We first show that this upper bound is reachable by some DFAs of this form 
(Lemma [T]). Then, we consider the state complexity of L\L\ in the other cases, that is when the rc-state 
DFA contains some final states that are not the initial state. We show that, in such cases, the upper bound 
(Theorem [D coincides with the lower bound (Theorem [2]). 

Lemma 1. For any m>2 and n>2, there exists a DFA Aofm states and a DFA Bofn states, where B 
has only one final state that is also the initial state, such that any DFA accepting the language L(A)L(B), 
which is equal to L(A)L(B)*, needs at least m2 n — 2" _1 states. 

Note that, if n = 1, due to Theorem 3 in |[T8l . for any DFA A of size m > 1, the state complexity of a 
DFA accepting L(A)L(B) (L(A)L{B)*) is m. 

In the rest of this section, we only consider cases L\L 2 where the DFA for Li contains at least one 
final state that is not the initial state. Thus, the DFA for L 2 is of size at least 2. 

When considering the size of the DFA for L\, we notice that, when the size of this DFA is 1, the state 
complexity of L\L* 2 is 1. 

Lemma 2. Let A he a 1 -state DFA and B be a DFA of n>\ states. Then, the necessary and sufficient 
number of states for a DFA to accept L(A)L(B)* is 1. 

Now, we focus on the cases when m > 1 and n > 1, and give an upper bound for the state complexity 
of L\L\. 

Theorem 1. Let A = (<2i,£, 5i, s\,F\) be a DFA such that \Q\\ = m> 1 and \Fi \ = k\, and B = 
(<22,£, o\,s 2 ,F 2 ) be a DFA such that \Q 2 \ = n > 1 and \F 2 — {sz\\ = k 2 >\. Then, there exists a DFA of 
at most m(2 n - 1 + 2"-^- 1 ) - k^-^- 1 states that accepts L(A)L(B)*. 

Proof. We denote F2 — {^2} by Fq. Then, |Fq| = k% > 1. 

We construct a DFA C = {Q,L, 8,s,F} for the language L\L\, where L\ and Li are the languages 
accepted by DFAs A and B, respectively. Intuitively, C is constructed by first constructing a DFA B' for 
accepting L* 2 , then catenating A to this new DFA. Note that, in the construction for B', we need to add an 
additional initial and final state s' 2 . By careful examination, we can check that the states of B' are state s' 2 
and the elements in P — {0}, where P is defined in the following. As the state set we choose 

Q = {rU p I r £ R and p G P}, where 

R = {S I S = {qt}, if qi $ F\,S = {qi,s 2 }, otherwise, where g,- <G gi}, and 
P = {S\SC(Q 2 -F Q )}U{T\TCQ 2 ,s 2 €T, and T n F ^ <d} . 

If si $lF\, the initial state s is s = {s\} U {0}, otherwise, s = {si,s' 2 } U {0}. 
The set of final states F is chosen to be F = {S G Q \ S n (F 2 U {s' 2 }) / 0}. 

We denote a state in Q as {qf\ U G, where qi € Q\ and G C Q 2 U {s 2 }. Then, the transition relation 8 
is defined as follows: 

S({qj}UG,a) =Di UD 2 UD 3 , for any a G L, where 
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D\: If 8\(qi,a) = q\ € F\, D\ = {q'^s^}, otherwise, D\ = {q'j}. 
D 2 : If 4 £ G> then D 2 = { 52^2, a)}, otherwise, D 2 = 0. 
D 3 : If G = 0, D 3 = 0, otherwise, 

D f &(G,a) if &(G,a)nF o = 0, 

3 1 8 2 (G,a) U {^2} otherwise. 

We can verify that the DFA C indeed accepts L\L 2 . It is clear that each state in Q should consist of 
exactly one state in Q\ and the states in one element of P — {0}. Moreover, if a state of Q contains a final 
state of A, then this state also contains the state s 2 . 

To get an upper bound for the state complexity of catenation combined with star, we should count 
the number of states of Q. However, as we will show in the following, some states in Q are equivalent. 

Let us recall the construction for B'. Note that, in that construction, states s 2 and s 2 should reach the 
same state on any letter in E. Also note that a state of Q contains s' 2 only when it contains a final state 
of A. Moreover, there exist pairs of states, denoted by {qf,s' 2 ,s 2 } U T and {qf,s' 2 } U T, such that qf is a 
final state of A and T C Q 2 \ {s 2 }. Then, we show that the two states in each of such pairs are equivalent 
as follows. For a letter aGl and a word w E £*, 

S{{qf,s' 2 ,s 2 }U T,aw) = 8({q f ,s' 2 }UT,aw) = 8(8({q f ,s' 2 }UT,a),w). 

Note that the equivalent states are only in the set F\ x {s 2 } x {S \ S C (Q 2 — Fq)}, and we can furthermore 
partition this set into two sets as 

Fi x {s' 2 } x {^2} x {S' I 5' C (Q 2 -F - {s 2 })} U 
Fi x {s' 2 } x {S' I 5' C (& -F - {s 2 })}. 

It is easy to see that, for each state in the former set, there exists one and only one equivalent state in the 
latter set, and vice versa. Thus, the number of equivalent pairs is k\2"~ kl ~ l . 

Finally, we calculate the number of inequivalent states of Q. Notice that there are m elements in R, 
2>'-k 2 elements in the first term of P, and (2 kl — l)2 n ~ kl ~ l elements in the second term of P. Therefore, 
the size of Q is \Q\ = m(2 n ~ l + 2 n ~ kl ~ l ). Then, after removing one state from each equivalent pair, we 
obtain the following upper bound 

m(2"- 1 +2 n - k2 - l )-k 1 2"- k2 - 1 . 

□ 

Next, we give examples to show that this upper bound can be reached. 

Theorem 2. For any integers m>2 and n>2, there exists a DFA A of m states and a DFA of n states 

3 

such that any DFA accepting L{A)L{B)* needs at least ~ 2 n ~ 2 states. 

Proof. We first give an example of two DFAs A and B of sizes m > 2 and n = 2, respectively, and we show 
that the number of states of a DFA accepting L(A)L(B)* reaches the upper bound given in Theorem [T] 
We use a three-letter alphabet £ = {a,b,c}. We omit the cases when n>2, due to the page limit. 

Define A = {Q\ ,L,8i,qo, {<7m-l})> where Q\ = {qo,q\ q m -\}, and the transitions are given as: 

• 8i(qi,a) =qi+i,i £ {0, ... ,m — 2}, 8\{q m ^i,a) = q , 
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• 8i(qi,b) =qi+i,i G {0,...,m-3}, 8\(q m -2,b) = q , 8\(q m -\,b) =q m -2, 

• 8i(qi,c) =q i+ i,iE {0,...,m-3}, 8\(q m - 2 ,c) = q , 5i(g m _i,c) =q„-i. 
Define 5 = (<22,£, 52,0, {1}), where £?2 = {0, 1}, and the transitions are given as: 

82(0,0) = 1, 82(p,b)=0, 82(0,c) = 0, 
§2(1, a) =0, &(1,*) = 1, §2(1, c)=0. 

Following the construction described in the proof of TheoremQ] we construct a DFA C = (Q3 53 , J3 , F3) 
that accepts L(A)L(B)*. Note that set P only contains three elements P = {0, {0}, {0, 1}}. To prove that 
C reaches the upper bound, it is sufficient to show that 1) all the states in (?3 are reachable from S3, 2) after 
merging the equivalent states {q m -i,0'} and {^ m _i,0',0}, the remaining states arepairwise inequivalent. 

We first consider the reachability of all the states. It is clear that state {qf\ U {©}, for i G {1, ... ,m — 
2}, and state {q m -\,0'} U {0} are reachable from S3 by reading the strings a' and a m_1 , respectively. 
Then, on letters b andc, we can reach states {q m -2,0} and {^ m _i,0',0}, respectively, from state {q m -\,0'}. 
Moreover, state {g,-,0}, i G {0, . . . ,m — 3}, can be reached from state {g m -2,0} by reading the string b' +i . 
Lastly, state {qt,0, 1}, / € {0, . . . ,m — 2}, and state {^ OT _i,0',0, 1}, are reachable from {g m -i,0'} on in- 
puts a !+1 and a m , respectively. 

Since states {q m -\,0'} and {^,„_i,0',0} are equivalent, we remove state {q m -\,0' ,0} from Q$, and 
show that the rest of the states are pairwise inequivalent. Let {q;} U G and {qj} UH be two different 
states in £?3 with < i < j < m — 1. There are three cases: 

1. i < j. Then the string a m ~ l ~'c is accepted by DFA C starting from state {g,-} U G, but it is not 
accepted starting from state {qj} UH. Note that, after reading a m ~ l ~'c, state {qj} UG reaches a state that 
contains states q m -\ and 0'. In contrast, the state reached by {qi}\JH on the same input does not contain 
these states. Moreover, the resulting states cannot contain state 1 , since on letter c, C remains in state 
from state and goes to state from state 1. 

2. i = j 7^ m — 1. Since P = {0, {0}, {0, 1}} consists of only three elements, we consider them 
individually. It is obvious that, state {^,,0, 1} is not equivalent to either {qj} or {g;,0}, since it is a final 
state but the latter two are not. States {qf\ and {g,,0} are inequivalent, since on the string ab we can 
reach a final state from state {^,,0} but not from state {qf\. 

3. i = j = m — 1 . There are only two states {q m - 1 , 0' } and {q m - 1 , 0' , 0, 1 }. They are inequivalent, 
because after reading a letter b, state {q m -\ ,0' ,0, 1} leads to a final state of C but {q,„-\,0'} does not. 

Due to 1) and 2), DFA C has at least 3m + 2 pairwise inequivalent reachable states, which reaches 
the upper bound in Theorem [T] □ 

4 Catenation combined with reversal 

In this section, we first show that the state complexity of catenation combined with an antimorphic in- 
volution 6 (Li 0(1-2)) is equal to that of catenation combined with reversal. That is, we show, for two 
regular languages L\ and L2, that sc(L\Q{Lq)) = sc{L\lJ£) (Corollary [T]). Then, we obtain the state com- 
plexity of LiLf by proving that its upper bound (Theorem^ coincides with its lower bound (TheoremlU 
Theorem [2 and Lemma [5]). 

We note that an antimorphic involution 6 can be simulated by the composition of two simpler 
operations: reversal and a mapping 0, which is defined as <p(a) = 6(a) for any letter a G E, and 
0(mv) = <p(u)<p(v) where u,v G £ + . Thus, for a language L, we have 6(L) = <p(L R ) and 6(L) = (<p(L)) R . 
It is clear that is a homomorphism. Thus, the language resulting from applying such a mapping to a 
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regular language remains to be regular. Moreover, we can obtain a relationship between the sizes of the 
two DFAs that accept L and <p (L), respectively. 

Lemma 3. Let L C £* be a language that is accepted by a minimal DFA of size n, n> 1. Then, the 
necessary and sufficient number of states of a DFA to accept (j)(L) is n. 

In order to show that the state complexity of L\Q(Lq) is equal to that of L\L 2 , we first show that 
the state complexity of catenation combined with is equal to that of catenation, i.e., for two regular 
languages L\ and L 2 , sc(Li<p(L 2 )) = sc(L\L 2 ). Due to the above lemma, if L 2 is accepted by a DFA 
of size n, <$>{Lq) is accepted by another DFA of size n as well. Thus, the upper bound for the number 
of states of any DFA that accepts Li0(L 2 ) is clearly less than or equal to m2 n — 2 n ~ l . The next lemma 
shows that this upper bound can be reached by some languages. 

Lemma 4. For integers m>\ and n>2, there exist languages L\ and L 2 accepted by two DFAs of sizes 
m and n, respectively, such that any DFA accepting Li 0(1-2) needs at least m2 n — 2"~ l states. 

As a consequence, we obtain that the state complexity of catenation combined with is equal to that 
of catenation. 

Corollary 1. For two regular languages L\ and L 2 , sc(L\ty(L 2 )) = sc{L\L 2 ). 

Then, we can easily see that the state complexity of catenation combined with B is equal to that of 
catenation combined with reversal as follows. 

sc(L l d(L 2 )) = sc(L l ^(L^))=sc{L l L^). 

In the following, we study the state complexity of L\L\ for regular languages L\ and L 2 . We will first 
look into an upper bound of this state complexity. 

Theorem 3. For two integers m,n>\, let L\ and L 2 be two regular languages accepted by an m-state 
DFA with k\ final states and an n-state DFA with k 2 final states, respectively. Then there exists a DFA of 
at most m2 n — k\2 n ~ k2 (2 kl — 1) — m + 1 states that accepts L\L\. 

Proof. Let M = (Qm,^,8m,sm,Fm) be a DFA of m states, k\ final states and L\ = L(M). Let Af = 
(<2iv>£) Sn,s n ,F n ) be another DFA of n states, k 2 final states and L 2 = L(N). Let N' = (Qn, £, 8n>,F n , {s^}) 
be an NFA with k 2 initial states. d^i(p,a) =qif 8^{q,a) = p where a € £ and p,q <G Qn- Clearly, 

L(N') = L(N) R = L\. 

After performing subset construction on_/V', we can get an equivalent, 2"-state DFA A = (Qa,^, 8a,sa,Fa) 
such that L(A) = h\. Please note that A may not be minimal and since A has 2 n states, one of its final 
state must be Qn- Now we construct a DFA B = (<2s,£, <>b,sb,Fb) accepting the language L]Lf , where 

Qb = {(i,j)\ieQM,jeQ A }, 

sb = (sm,&), if s M ^F M ; 

= (sm,Fn), otherwise, 

F B = {(iJ)eQ B \jeF A }, 

8 B {{i,j},a) = (/',/), if 8 M (i,a) = i', 8 A (j,a) = f, a £ E, i' £ F M ; 

= (i',jUF N ), if 8 M (i,a) = i', 8 A (j,a) = f, fl£l, /' G F M . 

It is easy to see that 8s((i, Qn)-, ) € F B for any i G Q M and a G E. This means all the states (two-tuples) 
ending with Qn are equivalent. There are m such states in total. 
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On the other hand, since NFA N' has k 2 initial states, the states in B starting with i G F M must end 
with j such that C j. There are in total k\2 n ~ kl {2 kl — 1) states which don't meet this. 
Thus, the number of states of the minimal DFA accepting L\L R is no more than 

ml" -ki2 n - k2 (2 h -l)-m+l. 

□ 

This result gives an upper bound for the state complexity of L\L R . Next we show that this bound is 
reachable. 

Theorem 4. Given two integers m>2,n>2, there exists a DFA Mofm states and a DFA Nofn states 
such that any DFA accepting L{M)L(N) R needs at least m2 n — 2 n ~ x — m + 1 states. 

Proof. Let M = (<2m>£> §m,0, — 1}) be a DFA, where Qm = {0, 1, ... ,m — 1}, E = {a,b,c}, and the 
transitions are given as: 

• 8m{i,x) = i, i = 0, . . . ,m — \,x G {a,b}, 

• 8m(i, c) = i + 1 mod m, / = 0, . . . , m — 1 . 

Let ,/V = (<2at,£, 5^,0, {0}) be a DFA, where <2w = {0,1, ... ,n — I}, H = {a, b,c}, and the transitions 
are given as: 

• 8^(0, a) =/2 — 1 , 8M(i,a) = i — 1, i = 1,. . . ,n — 1, 

• 8^(0, b) = 1, 5jv(/,&) = i, i = 1, . . . ,n — 1, 

• <5 w (0,c) = 1, <5 w (l,c) =0, 5jv(j,c) = j, j = 2,... ,n- 1, if n > 3. 

Now we design a DFA A = (<2a,£, 5a, {0}, F A ), where g A = {q \ q C gjv}, £ = {a,6,c}, = {<? I 
<E q, q £ Qa}, and the transitions are defined as: 

5A{p,e) = {j I <5;v(./» = i, i G /?}, p eQ A ,ee I. 

It has been shown in |[T8l that A is a minimal DFA that accepts L(N) R . Let B = (2b, £ = {a,b,c}, 8b, sb 
(0,0), Fa) be another DFA, where 

Qb = {(p,q)\peQ M -{m-l},qEQ A -{Q N }}U{(0,Q N )} 
U{{m-\,q) \qeQ A -{Q N }, {0} G <?}, 

= {(P,<l) k G^A, £<2b}, 

and for each state G 2s and each letter eGE, 



S B ({p,q),e) 



(p',q'} if 8 M (p,e)=p' ^m- I, 8 A (q,e)=q' ^Q N , 
(p',q') if 8 M (p,e) =p' = m- 1, 

S A (<7,e) = r', ? ' = r'U{0},^e«, 
<0,Giv) if 5 M (p,e)=m-l,5 A ^,e) = r / ,r / U{0} = e iV , 
, (0,g w ) if 8 M (p,e) j^m- 1, <5 A (<?,e) = gw. 



As we mentioned in last proof, all the states (two-tuples) ending with Qn are equivalent. So here, we 
replace them with one state: (0, Qn)- And all the states starting with m — 1 must end with j G Q A such 
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that G j. It is easy to see that B accepts the language L(M)L(N) R . It has ml' 1 — 2 n ~ l — m + 1 states. 
Now we show that B is a minimal DFA. 

(I) We first show that every state G Qb is reachable by induction on the size of j. Let k = \j\ 
and k <n — \. Note that state (0, Qn) is reachable from state (0,0) over string c m b(ab) n ~ 2 . 

When k = 0, i should be less than m — 1 according to the definition of B. Then, there always exists a 
string w = c l such that 5 B ((O,0),w) = (/,0). 

Basis (k= 1): State (m — 1 , {0}) can be reached from state (m — 2, 0) on a letter c. State (0, {0}) can 
be reached from state (m — 1,{0}) on string ca n ~ l . Then, for i G {1, . . . ,m — 2}, state (/,{0}) is reachable 
from state (i — 1, {0}) on string ca"~ l . Moreover, for i G {0, ... ,m — 2}, state is reachable from 
state (i, {0}) on string aK 

Induction steps: Assume that all states (/, j) such that \j\ < k are reachable. Then, we consider the 
states where \ j\ = k. Let j = {71,72, • • • ,jk} such that < j\ < 72 < ■ • ■ < jk<n — l. We consider 
the following four cases: 

1. 71 = and 72 = 1. State (m — 1, {0, 1,73, . . . ,jk}) is reachable from state (m — 2, {0,73, . . . ,jk}} 
on a letter c. Then, for i G {0, ... ,m — 2}, state (/, 7) can be reached from state (m — 1, {0, 1 ,73 , ... , 7^}) 
on string c !+1 . 

2. / = 0, 71 = 0, and 72 > 1. State (0,7) can be reached as follows: 

(0, {71,72, • • • ,7*}) = <>B((m - 2, {73 - 72+ 1, . . . ,j k ~ h + - h + !}), c2 ^ 2_1 )- 

3. / = and j\ > 0. State (0, 7) is reachable from state (0, {0, 72 — 71 , ... , 74 — 71 }) over string a J1 . 

4. We consider the remaining states. For i G {1, . . . ,m — 1}, state (1,7) such that 71 = and 72 > 1 
can be reached from state (i — 1 , { 1 , 72 , . . . , jk}) on a letter c, and, for i G { 1 , . . . , m — 2}, state (i, j) such 
that 71 > is reachable from state (i, {0, 72 — 71 , ... , 7& — 71 }) over string a 71 . Recall that we do not have 
states such that i = m — 1 and j\ > 0. 

(II) We then show that any two different states and (12,72) in 2b are distinguishable. Let us 
consider the following three cases: 

1- 7i 7^ 72- Without loss of generality, we may assume that I71 1 > I72I. Let x G 71 — 72. We don't 
need to consider the case when x = 0, because, if G 71 — 72, then the two states are clearly in different 
equivalent classes. For < x < n— 1, there always exists a string t such that 5fl((ii,7i),f) G and 
$B{{h,h),t) £F B , where 

{a"-* if ?2 7^ m — 1, 71 7^ 72, 

a" - * -1 ca if ?2 = m — 1, 71 7^ 72, « > 2, 
c if i 2 = m - 1, 71 7^ 72, « = 2. 

Note that, under the second condition, after reading the prefix a n ~ x ~ l of t, state n — 1 cannot be in the 
second component of the resulting state. This is because x G" 72. 

Also note that when n = 2, j\, 72 G {Qn, {0}, {1}}, where Qn = {0, 1}. Moreover, when 12 = m — 1, 
(12,72) can only be (m — 1, {0}). Due to the definition of B, we have that, for s > 1, (j, 2a?) ^ <2g. Thus, 
it is easy to see that (11,71) is either (?'i,{l}) or (0, {0, 1}). When (11,71) = (?i,{l}), G 71 —72, so 
the two states are distinguishable. When (11,71) = (0, {0, 1}), a string c can distinguish them because 
5 B ((0,{0,l}),c) GF B and5 B ((m-l,{0}),c) <£F B . 

2- 71 = 72 7^ ii 7^ J 2- Without loss of generality, we may assume that i\ > ?2. In this case, 
ii^m — X. LetxEQx — ji. There always exists a string u = a n ~ x+l bc m ~ l ~ n such that 5g((z'i,7i),w) G F B 
and 5b((?2,72),m) ^ ^b- 
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Let (h,j[) and {h,j\) be two states reached from states and (iiji) on the prefix a n ~ x+{ of 

w, respectively. We notice that state \ ofN cannot be in j[ . Then, after reading another letter b, we reach 
states (h,j'{) an d {h, ]'{)■> respectively. It is easy to see that states and 1 of N are not in j". Lastly, 
after reading the remaining string c m ~ l ~ n from state (h,f{), the first component of the resulting state 
is the final state of DFA M and therefore its second component contains state of DFA N. In contrast, 
the second component of the resulting state reached from state (hj'i) on the same string cannot contain 
state 0, and hence it is not a final state of B. Note that this includes the case that j\ = j% = 0, i\ ^ i%. 

3. We don't need to consider the case j\ = }% = Qn, because there is only one state in Qb which ends 
with 2^. It is (0,Q N ). 

Since all the states in B are reachable and pairwise distinguishable, DFA B is minimal. Thus, any 
DFA accepting L(M)L(N) R needs at least ml" - 2 n ~ 1 - m + 1 states. □ 

This result gives a lower bound for the state complexity of L(M)L(N) R when m,n>2. It coincides 
with the upper bound when k\ = 1 and &2 = 1- In the rest of this section, we consider the remaining 
cases when either m = 1 or n = 1. We first consider the case when m = 1 and n > 3. We have L\ = or 
L\ = E*. When L\ = 0, for any L2, a 1-state DFA always accepts L\L R , since L\L R = 0. The following 
theorem provides a lower bound for the latter case. 

Theorem 5. Given an integer n>3, there exists a DFA Mof\ state and a DFA Nofn states such that 
any DFA accepting L{M)L(N) R needs at least 2"~ l states. 

Now, we consider the case when m = 1 and n = 2. 

Lemma 5. There exists a \-state DFA M and a 2-state DFA N such that any DFA accepting L(M)L(N) R 
needs at least 2 states. 

Lastly, we consider the case when m > 1 and n = 1. When L2 = 0, for any L\, a 1-state DFA always 
accepts UL\ = 0. When L 2 = I*, L X L R = L{L*, since (L*) R = I*. Due to Theorem 3 in 0JD, which 
states that, for any DFA A of size m > 1, the state complexity of L(A)L* is m, the following is immediate. 

Corollary 2. Given an integer m>\, there exists an m-state DFA M and a l-state DFA N such that any 
DFA accepting L(M)L(N) R needs at least m states. 

After summarizing Theorems [3l 0J and [51 Lemma |5J and Corollary [2j we obtain the state complexity 
of the combined operation L\L R . 

Theorem 6. For any integer m>\,n> \, m2 n — 2 n ~ i —m + \ states are both necessary and sufficient 
in the worst case for a DFA to accept L(M)L(N) R , where M is an m-state DFA and N is an n-state DFA. 

5 Conclusion 

Motivated by their applications, we have studied the state complexities of two particular combinations of 
operations: catenation combined with star and catenation combined with reversal. We proved that they 
are significantly lower than the compositions of the state complexities of their individual participating 
operations. Thus, this paper shows further that the state complexity of a combination of operations has 
to be studied individually. 
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